DWT and LPC based feature extraction methods for isolated word recognition

نویسندگان

  • Navnath S. Nehe
  • Raghunath S. Holambe
چکیده

In this article, new feature extraction methods, which utilize wavelet decomposition and reduced order linear predictive coding (LPC) coefficients, have been proposed for speech recognition. The coefficients have been derived from the speech frames decomposed using discrete wavelet transform. LPC coefficients derived from subband decomposition (abbreviated as WLPC) of speech frame provide better representation than modeling the frame directly. The WLPC coefficients have been further normalized in cepstrum domain to get new set of features denoted as wavelet subband cepstral mean normalized features. The proposed approaches provide effective (better recognition rate), efficient (reduced feature vector dimension), and noise robust features. The performance of these techniques have been evaluated on the TI-46 isolated word database and own created Marathi digits database in a white noise environment using the continuous density hidden Markov model. The experimental results also show the superiority of the proposed techniques over the conventional methods like linear predictive cepstral coefficients, Mel-frequency cepstral coefficients, spectral subtraction, and cepstral mean normalization in presence of additive white Gaussian noise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Isolated Word Recognition Using Very Simple Recurrent Neural Network Plus and Noise Compensation LPC Analysis

In this paper, the isolated word recognition using very simple recurrent neural network plus (VSRN+) and noise compensation linear predictive coding (LPC) analysis is proposed. In the proposed system, a feature extraction based on noise compensation LPC analysis achieves accurate speech feature vector in noisy environment. Since VSRN+ is utilized as a recognizer, the proposed system is able to ...

متن کامل

Wavelet Transform Speech Recognition Using Vector Quantization, Dynamic Time Warping and Artificial Neural Networks

In this paper we investigate the performance of the Discrete Wavelet Transform (DWT) with Dynamic Time Warping, Vector Quantization and Artificial Neural Networks for speaker-dependent, isolated word recognition. Wavelet Transforms have demonstrated good time-frequency localization properties and are appropriate tools for the analysis of non-stationary signals like speech. Moreover, unlike LPC,...

متن کامل

Comparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition

The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...

متن کامل

Feed Forward Back Propagation Neural Network Method for Arabic Vowel Recognition Based on Wavelet Linear Prediction Coding

A novel vowel feature extraction method via hybrid wavelet and linear prediction coding (LPC) is presented here. The proposed Arabic vowels recognition system is composed of very promising techniques; wavelet transform (WT) with linear prediction coding (LPC) for feature extraction and feed forward backpropagation neural network (FFBPNN) for classification. Trying to enhance the recognition pro...

متن کامل

On the Use of the Formant Features in the Dynamic Time Warping Based Recognition of Isolated Words

A possibility to use the formant features (FF) in the user-dependent isolated word recognition has been investigated. The word recognition was performed using a dynamic time-warping technique. Several methods of the formant feature extraction were compared and a method based on the singular prediction polynomials has been proposed for the recognition of isolated words. Recognition performance o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • EURASIP J. Audio, Speech and Music Processing

دوره 2012  شماره 

صفحات  -

تاریخ انتشار 2012